AITopics

2509.0554

Country: South America > Brazil (1.00)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Bernal, Arturo Miguel Russell, Petterson, Maureen, Granadeno, Pedro Antonio Alarcon, Murphy, Michael, Mason, James, Cleland-Huang, Jane

Validating Terrain Models in Digital Twins for Trustworthy sUAS Operations

arXiv.org Artificial IntelligenceAug-25-2025

--With the increasing deployment of small Unmanned Aircraft Systems (sUAS) in unfamiliar and complex environments, Environmental Digital Twins (EDT) that comprise weather, airspace, and terrain data are critical for safe flight planning and for maintaining appropriate altitudes during search and surveillance operations. With the expansion of sUAS capabilities through edge and cloud computing, accurate EDT are also vital for advanced sUAS capabilities, like geolocation. However, real-world sUAS deployment introduces significant sources of uncertainty, necessitating a robust validation process for EDT components. This paper focuses on the validation of terrain models, one of the key components of an EDT, for real-world sUAS tasks. These models are constructed by fusing U.S. Geological Survey (USGS) datasets and satellite imagery, incorporating high-resolution environmental data to support mission tasks. V alidating both the terrain models and their operational use by sUAS under real-world conditions presents significant challenges, including limited data granularity, terrain discontinuities, GPS and sensor inaccuracies, visual detection uncertainties, as well as onboard resources and timing constraints. We propose a 3-Dimensions validation process grounded in software engineering principles, following a workflow across granularity of tests, simulation to real world, and the analysis of simple to edge conditions. We demonstrate our approach using a multi-sUAS platform equipped with a T errain-A ware Digital Shadow. As swarms of small Unmanned Aircraft Systems (sUAS) are increasingly deployed in complex, unstructured environments such as disaster zones, wilderness areas, and wildfire regions, the need for accurate environmental models becomes critical. Effective sUAS mission planning requires awareness not only of dynamic airspace and weather conditions but also of the underlying terrain. In such settings, terrain is often the dominant factor influencing flight safety, sensor placement, line-of-sight communications, and search effectiveness. This paper focuses specifically on the role of terrain models that enable mission-level decision-making and flight planning for sUAS operations. However, terrain inaccuracies or blind spots, such as missing elevation data, undetected peaks, or mismatched georeferencing, can result in ineffective or even hazardous behavior by autonomous vehicles. To minimize these issues, we construct and maintain a terrain model by fusing multiple sources of environmental data, including public USGS datasets [1], [2], and satellite imagery [3].

artificial intelligence, machine learning, terrain model, (18 more...)

2508.16104

Country: North America > United States > Indiana (0.14)

Genre:

Research Report (0.50)
Workflow (0.48)

Industry:

Transportation > Air (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Aerospace & Defense > Aircraft (0.86)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.54)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Ngo, Nghia Trung, Van Nguyen, Chien, Dernoncourt, Franck, Nguyen, Thien Huu

Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering

arXiv.org Artificial IntelligenceNov-14-2024

Retrieval-augmented generation (RAG) has emerged as a promising approach to enhance the performance of large language models (LLMs) in knowledge-intensive tasks such as those from medical domain. However, the sensitive nature of the medical domain necessitates a completely accurate and trustworthy system. While existing RAG benchmarks primarily focus on the standard retrieve-answer setting, they overlook many practical scenarios that measure crucial aspects of a reliable medical system. This paper addresses this gap by providing a comprehensive evaluation framework for medical question-answering (QA) systems in a RAG setting for these situations, including sufficiency, integration, and robustness. We introduce Medical Retrieval-Augmented Generation Benchmark (MedRGB) that provides various supplementary elements to four medical QA datasets for testing LLMs' ability to handle these specific scenarios. Utilizing MedRGB, we conduct extensive evaluations of both state-of-the-art commercial LLMs and open-source models across multiple retrieval conditions. Our experimental results reveals current models' limited ability to handle noise and misinformation in the retrieved documents. We further analyze the LLMs' reasoning processes to provides valuable insights and future directions for developing RAG systems in this critical medical domain.

accuracy, information, relevant document, (16 more...)

2411.09213

Country:

North America > United States > Oregon (0.04)
North America > Canada (0.04)
Europe > Austria (0.04)
(3 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Consumer Health (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Gallego-Fontenla, Victor, Vidal, Juan C., Lama, Manuel

Gradual Drift Detection in Process Models Using Conformance Metrics

arXiv.org Artificial IntelligenceMay-8-2023

Changes, planned or unexpected, are common during the execution of real-life processes. Detecting these changes is a must for optimizing the performance of organizations running such processes. Most of the algorithms present in the state-of-the-art focus on the detection of sudden changes, leaving aside other types of changes. In this paper, we will focus on the automatic detection of gradual drifts, a special type of change, in which the cases of two models overlap during a period of time. The proposed algorithm relies on conformance checking metrics to carry out the automatic detection of the changes, performing also a fully automatic classification of these changes into sudden or gradual. The approach has been validated with a synthetic dataset consisting of 120 logs with different distributions of changes, getting better results in terms of detection and classification accuracy, delay and change region overlapping than the main state-of-the-art algorithms.

artificial intelligence, gradual change, machine learning, (18 more...)

2207.11007

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Spain > Galicia > A Coruña Province > Santiago de Compostela (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)

#artificialintelligenceDec-12-2022, 00:43:29 GMT

GitHub - aws/sagemaker-python-sdk: A library for training and deploying machine learning models on Amazon SageMaker

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the SDK, you can train and deploy models using popular deep learning frameworks Apache MXNet and TensorFlow. You can also train and deploy models with Amazon algorithms, which are scalable implementations of core machine learning algorithms that are optimized for SageMaker and GPU training. If you have your own algorithms built into SageMaker compatible Docker containers, you can train and host models using these as well. For detailed documentation, including the API reference, see Read the Docs.

amazon sagemaker, library, sagemaker python sdk, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

#artificialintelligenceMar-11-2022, 15:03:30 GMT

What is MLOps?

Ever liked something on Instagram and then, almost immediately, had related content in your feed? Or search for something on Google and then be spammed with ads for that exact thing moments later? These are symptoms of an increasingly automated world. Behind the scenes, they are the result of state-of-the-art MLOps pipelines. We take a look at MLOps and what it takes to deploy machine learning models effectively. We start by discussing some key aspects of DevOps.

devops, pipeline, software, (13 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.91)

#artificialintelligenceJan-1-2022, 19:30:46 GMT

Effective Testing for Machine Learning (Part I)

Update: Part II is out now! This blog post series describes a strategy I've developed over the last couple of years to test Machine Learning projects effectively. Given how uncertain ML projects are, this is an incremental strategy that you can adopt as your project matures; it includes test examples to provide a clear idea of how these tests look in practice, and a complete project implemented with Ploomber is available on GitHub. By the end of the post, you'll be able to develop more robust ML pipelines. Testing Machine Learning projects is challenging. Training a model is a long-running task that may take hours to run and has a non-deterministic output, which is the opposite we need to test software: quick and deterministic procedures.

dependency, integration test, pipeline, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceDec-17-2021, 19:30:44 GMT

Effective Testing for Machine Learning (Part I)

This blog post series describes a strategy I've developed over the last couple of years to test Machine Learning projects effectively. Given how uncertain ML projects are, this is an incremental strategy that you can adopt as your project matures; it includes test examples to provide a clear idea of how these tests look in practice, and a complete project implemented with Ploomber is available on GitHub. By the end of the post, you'll be able to develop more robust ML pipelines. Testing Machine Learning projects is challenging. Training a model is a long-running task that may take hours to run and has a non-deterministic output, which is the opposite we need to test software: quick and deterministic procedures. One year ago, I published a post on testing data-intensive projects to make Continuous Integration feasible.

dependency, integration test, pipeline, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceJul-8-2021, 22:30:12 GMT

How Uber Implements CI/CD Of Machine Learning Models

The ride-hailing giant Uber is currently present in 10K cities across 71 countries, and its platform is used by 93 million customers and 3.5 million drivers globally. Every quarter, the ride-hailing platform completes nearly 1.44 billion trips. However, as a result of a global pandemic and travel restrictions, the total number of quarterly Uber trips decreased by 24.21% in 2020. "At Uber, we have witnessed a significant increase in ML adoption across various organisations and use-cases over the last few years," said the company in its latest blog post co-authored by Yi Zhang, Joseph Wang, Jia Li, and Yunfeng Bai. The blog further highlighted various pain points, alongside explaining the solution implementation of continuous integration (CI) and continuous deployment (CD) of machine learning models as a solution.

integration test, prediction service, staging integration test, (10 more...)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

The Independent - TechJun-18-2021, 12:58:17 GMT

HBO Max mocked and consoled after sending odd 'integration test' email – as it blames message on intern

HBO Max has been mocked and consoled after sending out an unusual email to its customers. The message – apparently sent to a significant numbers of the service's subscribers – was not advertising a new show or feature, but rather included only a cryptic message that appeared to have been sent out by mistake. "This template is used by integration tests only." As recipients opened the email, and quickly realised that it had been sent by mistake, the reaction ranged from mockery to sympathy for the person who had clearly sent what was an internal test email out to potentially millions of subscribers. Live facial recognition technology creates'supercharged CCTV' that could be used recklessly, Information Commission warns Bitcoin price news – live: Crypto struggles to bounce back as slump continues Nasa attempting to restart Hubble Space Telescope after it was forced into'safe mode' by computer error Live facial recognition technology creates'supercharged CCTV' that could be used recklessly, Information Commission warns Nasa attempting to restart Hubble Space Telescope after it was forced into'safe mode' by computer error Many joked that the integration test email sounded like a show that could be on the service.

email, hbo max, test email, (11 more...)

The Independent - Tech

Industry:

Banking & Finance > Trading (1.00)
Media > Television (0.80)
Leisure & Entertainment (0.80)

Technology:

Information Technology > e-Commerce > Financial Technology (0.64)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.49)